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Abstract 

In order to check finite propagation speed Fermi, in 1932, had considered 
two atoms A and B separated by some distance R. At time t = 0, ^ is in an 
excited state, B in its ground state, and no photons are present. Fermi's idea 
was to calculate the excitation probability of B. In a model-independent way 
and with minimal assumptions - Hilbert space and positive energy only - it is 
proved, not just for atoms but for any systems A and B, that the excitation 
probability of B is nonzero immediately after t = 0. Possible ways out to avoid 
a contradiction to finite propagation speed are discussed. The notions of strong 
and weak Einstein causality are introduced. 



1 Introduction 

One of the pillars of special as well as general relativity is the assumption that no 
signals can be transmitted faster than the speed of light. If there were arbitrarily 
high signal velocities in nature then either 

iThis article has appeared in: NONLINEAR, DEFORMED AND IRREVERSIBLE QUANTUM 
SYSTEMS, Proceedings of an International Symposium on Mathematical Physics at the Arnold 
Sommerfeld Institute 15-19 August 1994, Clausthal, Germany. Editors: H.-D. Doebner, V.K. Do- 
brev, P. Nattermann. WORLD SCIENTIFIC, Singapore (1995), p. 253 - 264 



• those superfast signals could be used to synchronized clocks to yield absolute 
simultaneity and thus a breakdown of relativity theory, or 

• there would exist "tachyons", and the sequence of cause and effect could be 
reversed. 

The second alternative has captured imaginative minds and prompted them to 
create science-fiction like scenarios. The concept of finite signal velocity or, more 
precisely, the speed of light as highest signal velocity, is therefore often called "Einstein 
causality" . In my opinion, though, if Einstein causality were to fail most physicist 
would adopt the first alternative and reformulate or abandon relativity theory. 

For this reason the question of finite signal velocity in quantum theory attracted 
the interest of Heisenberg and Fermi in the early thirties, in particular whether pho- 
tons traveled with the speed of light. 

In 1932, Fermi [|I| consider for this purpose a simple model. Two atoms, A and B, 
are separated by a distance R. At time t = 0, A is assumed to be in excited state and 
B in its ground state, with no photons present. Atom A will decay into its ground 
state under the emission of a photon. This photon can then, with a small probability, 
be absorbed by atom B. Fermi asked the question at what earliest time atom B will 
"notice" the decay of atom A with its accompanying photon. He expected that B 
moves out of its ground state only after a time t = R/c, in accordance with the speed 
of light. And indeed, this was what he found by his calculations. 

Fermi's calculations were based on second-order perturbation theory - a technique 
still quite common today - and on the approximation of an integral over positive 
frequencies by an integral over positive and negative frequencies ranging from — oo 
to oo instead of to oo. More than thirty years later Shirokov pointed out 
that without this replacement the calculations would not yield the desired result 0. 
It remained unclear, however, what would happen if one went to higher orders in 
perturbation theory. 

The setup of Fermi's model will be further discussed in the next section. Fermi 
had calculated the probability for the following transition: A nonexcited, B excited 
and no photons. As will be discussed this is an exchange probability 0] and it does 
not directly, without further assumptions, refer to Einstein causality but to what one 
nowadays calls local and nonlocal correlations. 

Fermi's problem was investigated by many authors in this or in a related form, 
e.g. by Heitler and Ma Hamilton [Q, Fierz 0, Ferretti 0, Milonni and Knight 
10], Shirokov and his review 0], Rubin |T^, Biswas et al. [|TT|, and Valentini |]T2 



The older papers confirmed Fermi's conclusion, while the results of the later papers 
depend on the model and the approximations used. At present there seems to be 
agreement that Fermi's 'local' result is not correct, but that this nonlocality cannot 
be used for superluminal signal transmission since measurements on A and B as well 
as on photons are involved. 
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The present contribution, which is partially based on Ref. is mathematically 
very simple. It analyzes Fermi's model under quite general and simple assumptions 
- essentially just positivity of the energy. No perturbation theory, no specific form of 
the Hamiltonian, nor further assumptions of quantum field theories like the locality 
postulate, are used, and the conclusions hold for relativistic and nonrelativistic theo- 
ries. Moreover, the atoms can be replaced by more general "sources" and "detectors" . 

Specifically, it is shown for the model considered by Fermi that the excitation 
probability of atom B would be immediately nonzero if the experiment could really 
be performed. At first sight this result might seem to indicate serious difficulties with 
causality for Fermi's two-atom model. However, already in Ref. I pointed out 
several ways to avoid this disastrous consequence, and in the last section I will discuss 
additional ones. The message is that finite signal velocity is a delicate question. 

Somewhat surprisingly, the results of my paper or received great publicity and 
were discussed not only in science journals like Nature or New Scientist fl^, but 



also made it to the daily press and weekly magazines []T3[ . While some discussions were 



reasonably serious, smaller tabloids tended to sensationalize by picking on acausality 
and omitting the ways out The moral to draw from this is that one should not 
rely on second or third hand accounts, in particular not on sensational ones. 



2 Fermi's model: Correlations, Excitation Proba- 
bilities, and Bare States 

Fermi supposed in his model that by some means one had prepared, at time t = 0, 
atom A in an excited state, je^), and atom B in its ground state, Igs), with no 
photons present. The state of the complete system then developed in time and Fermi 
calculated the "exchange" probability to find the state \gA)\€-B)\Oph) at time t. He 
probably had in mind that this could occur only by deexcitation of A, emission of a 
photon by A, absorption of it by B and excitation of B. However, actually to check 
that there are no photons requires, at least in principle, photon measurement over 
all space, not only measurements of the states of A and B. Hence such an exchange 
probability cannot be used for signals, it just refers to statistical correlations. Really 
needed in this model approach to finite signal velocity is the probability of finding B 
excited, irrespective of the state of A and possible photons; if there turn out to be no 
photons, all the better. This excitation probability could then, in Fermi's approach, 
be determined by a measurement on B alone. 

Using "bare" states, as Fermi did, the Hilbert space is simply the tensor product 

'Hha.re = T^-A X 'Hb X Hp (l) 
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and the Hamiltonian is of the form 



-f^barc = Ha + Hb + Hp + Haf + Hbf ■ (2) 
At time t = 0, the initial state is 

iV'o^n = |eA)bB)|Op,) . (3) 

At time t, the probabihty of finding B in some excited state is then a sum overall 
excited states le^), over all states I^a) of A and over all photon states |{n}), i.e. 

es iA {n} 

- E KA)|eB)|{n})({n}|(eB|(.A|} l^r^) 

iA,es,{n} 

= (V'M 1a X ^ |eB)(eB| X 1^ 1^^^^) . (4) 

The r.h.s. is the expectation of the operator 

= lAX^|es)(es|xl^. (5) 

es 

This operator represents the observable is in a bare excited state" , and here it is 
a projection operator. 

3 Renormalized States 

Bare states are widely used in quantum optics and are usually quite adequate. How- 
ever, for subtle questions of principle of a physical theory great care is needed. Ap- 
proximations and perturbation theory may give misleading results. In one order an 
effect might show up, but not in the next order, and so on. Unrenormalized bare 
theories are, without cut-off, mathematically not well-defined. They are plagued by 
infinities whose cancellation has not been investigated for signal velocities. 

For the present purpose it suffices to use only rudiments of a renormalized theory. 
We just need the following two simple properties, 

(i) existence, with a Hilbert space T^ren, 

(ii) a Hamiltonian, i^ren, which is bounded from below and self-adjoint ("positive 
energy" ) . 
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Then, in general, Tiren is no longer the tensor product in Eq. (|T]), and the initial state, 
denoted by \ipo), will not be a simple product state, 

iV'o) ^ \eA)\9B)Ki,) ■ 

Similarly, if the observable 

"B is in an excited state" (6) 
makes sense and is represented by an operator Oeg then in general 

However, its expectation values must lie between and 1 to represent a probability, 

< a, < 1 . (7) 

For example, Oeg might be a projector, as for bare states, but this would not be the 
most general case. Thus 

would be the excitation probability of B at time t, and it would involve a measurement 
on B only. It should be noted that the explicit form of the operator Oe^ is not required 
in the following, only Eq. (|^) will be used. 

Also no point-like localization of A and B are required. Generalizing Fermi's 
model, A and B may be systems initially localized in two regions separated by a 
distance R, with no photons present. The ground state of B may be degenerate. 
Again, with Fermi, one would suppose that one had somehow managed to prepare 
this initial state at t = 0. The analog of Fermi's original result would then be that 
the excitation probability -Pe(^) '^^ ^ would vanish for t < R/c. 

In the next section I will prove a simple mathematical theorem which applies to 
this situation and which yields that either 

(i) p^(t) ^ for almost all t, 

or 

(ii) P^{t) = for all t . 

This result does not agree with Fermi's original expectation . In the last section I will 
show how one can by-pass potential difficulties for finite signal velocity by modifying 
and clarifying the physical assumptions employed. There is also a mathematical 
loophole which could be used, although the theorem, of course, remains true. 
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4 The Theorem 



To more clearly separate what is Physics and what Mathematics, I will phrase the 
theorem in purely mathematical terms although its main and possibly sole interest lies 
in its applications to the physical situation described above. So what was previously 
ifren HOW bccomcs any self-adjoint operator H bounded from below, and the initial 
state \ipo) can now be any state, while in the application it represents a physical 
situation in which A is supposed to be in an excited state, B in a. ground state and 
with no photons. 

Theorem. Let H be self-adjoint and bounded from below and let O be any operator 
satisfying 

< C < 1 . (8) 

Let TpQ be any vector and define 

Then one of the following two alternatives hold. 

(i) {ipt^ O ipt) 7^ for almost all t, and the set of such t's is dense and open. 

(ii) (V^j, O V^t) = for all t. 

Proof. Let us define 

Pit) = O ^t) . (9) 

Since ipt is continuous in t, so is P{t). From this it follows immediately that the set 
A/q := {t; P{t) = 0} is closed and its complement Af^ is open. Since O is a positive 
operator, its positive square-root (9^/^ exists, and one has 

For t G Afo this vanishes, and thus 

C^/Vi = for t G ATo . (10) 
Now let (f) be any fixed vector and define the auxiliary function F^{t) by 

F^(t) = (0,Oe-^^*/%). (11) 

Hence, by Eq. (|ig), 

F^{t) = for t G ATo . (12) 
Since H > — const, one has that the operator 

iH{t+iy)/h 
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is well-defined for y < 0. Putting 

z = t + iy 

one sees that F^{z) can be defined as a continuous function for Im 2; < 0, and, 
moreover, F^{z) is analytic for Im 2; < 0. 

Let us now assume that (i) does not hold, i.e. that either A/q is not a null set or 
that its complement A/'q is not dense. It would suffice to consider the former, but the 
latter can be treated in an almost elementary way, so I consider it first. If TVq is not 
dense, A/q contains some nontrivial interval, / say. Hence F^{z) vanishes on /, by 



Eq. (0), and one can directly use the Schwarz reflexion principle [17| or proceed as 



follows. One defines an extension of to the upper half plane by putting 

F^{z) = F^{z*y for Im z > . (13) 

Since F^{t) is real for t e / it follows that the extension is continuous on /, and 
from this one can show that it is analytic for z ^ 1R\L Hence / is contained in the 
analyticity domain. Since F^{z) = for z G /, it therefore vanishes identically in 
its domain of analyticity. However, since it is continuous when approaching the real 
axis, it follows that F^{t) = for all t. Since was arbitrary this implies 

O^t = for all t, 

and this gives case (ii). 

This proves the interesting part of the theorem, namely that P{t) is either nonzero 
on a dense open set or that it vanishes identically. Since a dense open set need not 
have full Lebesgue measure this does not yet prove the full theorem. However, as a 
boundary value of a bounded analytic function, F^{t) satisfies the inequality [ITB 



/oo 
dt\n\F^{t)\/{l + t^)> ~oo (14) 
-00 

unless it vanishes identically. If A/q had positive measure the integral would be —00, 
and thus F(f,{t) would vanish identically, for each 0. This would again imply case (ii). 
Incidentally, this last argument also covers the previous case since, if Ao is a null set, 
its complement is dense. Since the argument based on the Schwarz reflexion principle 
is very transparent it has been included. This completes the proof of the theorem. 

There is a similarity of this result with the Reeh-Schlieder theorem which 
also exploits analyticity but uses stronger assumptions of field theory, in particular 
locality. It is therefore not directly applicable to the general situation considered 
here. 

Taking for O in the theorem the previously considered observable Oeg and for ipo 
the state \ipo) representing the initial state with A excited, B in its ground state and 
no photons - provided they exist - one obtains from the theorem that the excitation 
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probability of B is immediately nonzero after t = - unless it vanishes for all times, 
a case one might exclude on physical grounds. 

Another application can be made to the correlations mentioned in the Introduc- 
tion. Let iV'ex) denote the state representing A in a ground state, B in an excited 
state, and no photons, either in Hren or T^bare, provided again the notion makes sense 
in the case of Tiren- In the bare case one just has 

IV'ej^bare = IS'a) ICfi) |Oph) . 

We define 

Cex = |V'ex)(^ex| • 

The expectation value of Oex, 

is just the transition probability to \ipex)- Since Oex is a projector the theorem yields 
that the transition probability is immediately nonzero, unless it vanishes identically. 
But since this is just a correlation function of measurements at different positions 
this result has no bearing on signal velocities. 

Further, more general, applications are possible. A and B can be any quantum 
systems, e.g. a "source" and detector; they may be moving. One may also envisage 
other particles and interactions. One may apply it also to a problem of Heisenberg 
who had suggested to consider an excited atom A with no photons and to calculate 
the probability to find a photon at time t in a region a distance R away. At that time 
this probability was found to vanish for t < R/c pO|] . 



5 Discussion and Ways out 



As already stressed in Ref. |T^, the theorem is a mathematically rigorous result, and 
its applications in Physics depend on the physical assumptions, leading to statements 
of the form, "If then For example, I had been careful, when introducing the 
excitation observable Oeg, to say, "if this makes sense". If it does exist, then indeed 
the excitation probability of B is immediately nonzero, or identically zero which one 
would exclude. Does this mean that atom B has been excited by a superluminal 
photon emitted by A7 Not necessarily. Before discussing this we discuss another, 
more mathematical, way out. 

Possible mathematical resolution 

An explicit calculation of transition probabilities or other quantities in quantum 
optics, or quantum electrodynamics or theories involving fields, will in general not 
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start from a renormalized theory with Hilbert space Tiren and Hamihonian H^en ^ the 
form is not known, not even the existence. Instead one will introduce cutoffs in the 
bare theory to make it well-defined, then calculate transition probabilities, and finally 
one will remove the cutoffs, taking care of divergent expressions by renormalization. 
For each cutoff the theorem may be applicable and may yield a nonzero probability for 
almost all t in t < R/c. However, as the cutoffs are tending to infinity, the nonzero 
probability in this time interval may in principle become smaller and smaller, and 
in the limit one might conceivably have for t < R/c. If this were so, then the 
mathematical assumptions - existence of Tiren, H^en, and Oeg - could not be fulfilled. 
Such a possibility, nonexistence of a Hilbert space after renormalization, has indeed 



been discussed in the literature 21 



Physical ways out 

We now discuss the more or less implicit physical assumptions that have been 
made, and how to by-pass them. 

(a) Systems localized in disjoint regions might not exist as a matter of principle, i.e. 
systems might always "overlap" . 

In ordinary quantum mechanics the wave-function associated with an energy level 
of a hydrogen atom extends to infinity, and this has been proposed before as a reason 
for overlapping [^. But since this happens in a nonrelativistic theory, this particular 
argument probably does not go to the heart of the matter. Moreover, in nonrelativis- 
tic quantum mechanics, there do exist wave-functions of the hydrogen atom which 
vanish outside some finite volume at a given fixed time. By completeness these wave- 
function can be obtained by suitable superpositions, but they spread out to infinitely 
instantaneously . 



A better argument for overlapping may seem that one might conceivably create 
particle-antiparticle pairs or other particles whenever one tries to localize a system too 
well. This has been advanced as an explanation for the difficulties one has in obtaining 



good localization or position operators ||2^, g^, g^, This is connected to the 

next point. 



(b) Renormalization may introduce a sort of photon cloud around each system, e.g. 
due to "vacuum fluctuations". This essentially means an overlapping of the systems 
with their clouds and leads back to (a). More specifically, photons of one cloud may 
excite one or the other system, and this might even happen with only one system 
present. 

(c) The notion of a ground state of B, either with or without A present, might not 
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make sense, due to renormalization. 



Strong and weak Einstein causality 

How then to check finite propagation speed? To clarify matters it is useful to differ- 
entiate between two notions of Einstein causality. 

(i) Strong Causality: For each individual process or experiment there is no excitation 
or disturbance of the second system for t < R/c. 

This notion is similar to energy - momentum and Baryon conservation in each 
individual scattering process in particle physics. Strong causality would hold if the 
transition probabilities considered above were strictly zero for t < R/c. It seems to 
me that both Fermi [jl| and Heisenberg - Kikuchi pO| had this in mind when they 



set out to prove that certain probabilities vanished for t < R/c. The above theorem 
shows that strong causality cannot be checked, unless the way out via cut-off theories 
holds, or it may fail, a possibility I do not advocate. 



(a) Weak Causality: This notion was introduced in Ref. p9l, and loosely speaking it 
means that Einstein causality holds for expectation values only, i.e. for ensembles, not 
for individual processes. For weak causality to hold, expectation values, i.e. ensemble 
averages, need not vanish for t < R/c, but it takes a time at least t = R/c to 
produce an effect on them. To exhibit this effect one may suitably subtract possible 
fluctuations of system B alone, e.g. vacuum fluctuations. 

Without additional assumptions to those of the theorem (Hilbert space, positive 
energy) probably nothing can be said about weak causality in Fermi's setup. Model 
calculations [T^, |1^ point in the right direction, although the renormalization prob- 
lem remains unsolved. "Bare" theories can sometimes give useful indications on the 
problem of weak causality, but cannot provide deflnitive answers. This is all the more 
true for bare theories with nonrelativistic atoms. 

Buchholz and Yngvason use the assumptions of the theory of local observ- 
ables ( "algebraic quantum fleld theory" ) , which are stronger than those employed for 
the above theorem. They point out that in this theory transition probabilities and 
the above observable Oeg are no legitimate quantities and thus not allowed. The 
idea is therefore to consider only observables which are allowed in the theory of lo- 
cal observables. These are observables associated with bounded space-time regions; 
in particular, sharp-time observables are excluded. To give the essential ideas, let 
\i^AB{t)) denote the state with systems A and B present, lipBit)) the state with only 
B present, and let B be an observable associated with a space-time region of system 
B. Then weak causality requires 

{^AB{t)\B\^AB{t)) = {^BitmMt)) for t<R/c; 

i.e. only the difference of both sides is zero for t < R/c. With the assumptions 
inherent in the theory of local observables this does hold 
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For simplicity let us assume that this could be applied to Fermi's question about 
the excitation probability of B (as remarked before, this is not a good observable in 
algebraic quantum field theory). This would then mean that the excitation probability 
of B could indeed be nonzero, but until t = R/c it would not depend on the presence 
of A. 

What does this mean physically? Expectation values are ensemble averages, and 
to check this nondependence on A experimentally one could not use a single pair of 
systems A and B. Instead one would need an ensemble of such pairs - either by 
repetition of the experiment or by simultaneous realization of many, say, of such 
pairs at t = 0. At time t one would then measure how many of the B systems are 
excited. Their fraction is 

PB{with A)(^) = Psit) 

if — > cxd; for finite this holds only approximately, due to statistical fluctuations. 
Now one would calculate the excitation probability of B without system A present, 
which is denoted by Pb(w/o a)('^)' subtract it. Weak causality would then assert 

Pk^m A)it) - P^Bi./o A)it) =0 for t < R/c . (15) 

Only for ^ cxD would this be strictly true experimentally since for finite A^ there 
would always be statistical fluctuations. 

Hence finite propagation velocity or speed of light in the sense of weak causality 
cannot be checked experimentally in a strict sense for finite ensembles since in this 
case there are always deviations from the exact zero in Eq. (p!5|). What is needed 
are rigorous bounds on the A^ dependence of the statistical fluctuations. The theory 
of local observables does not provide these, and rigorous model studies may be more 
promising for the question of bounds. 

In a strict sense, finite propagation speed as expressed by weak Einstein causality 
can only be checked experimentally for infinite ensembles, and this may suggest that 
this notion somehow belongs to a macroscopic context. 
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